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ABSTRACT 

Aims. Our aim is to study the large-scale structure of different types of AGN using the medium-deep XMM-LSS survey. 
Methods. We measure the two-point angular correlation function of ~ 5700 and 2500 X-ray point-like sources over the ~ 1 1 sq. deg. 
XMM-LSS field in the soft (0.5-2 keV) and hard (2-10 keV) bands. For the conversion from the angular to the spatial correlation 
function we used the Limber integral equation and the luminosity-dependent density evolution model of the AGN X-ray luminosity 
function. 

Results. We have found significant angular correlations with the power-law parameters y = 1.81 ± 0.02, 9 = 1.3" ± 0.2" for the soft, 
and y = 2.00 ± 0.04, So = 7.3" ± 1.0" for the hard bands. The amplitude of the correlation function w(6) is higher in the hard than 
in the soft band for f x < 10~' 4 erg s -1 cirT 2 and lower above this flux limit. We confirm that the clustering strength 6 grows with the 
flux limit of the sample, a trend which is also present in the amplitude of the spatial correlation function, but only for the soft band. In 
the hard band, it remains almost constant with r a 10 hT 1 Mpc, irrespective of the flux limit. Our analysis of AGN subsamples with 
different hardness ratios shows that the sources with a hard-spectrum are more clustered than soft-spectrum ones. This result may be 
a hint that the two main types of AGN populate different environments. Finally, we find that our clustering results correspond to an 
X-ray selected AGN bias factor of ~ 2.5 for the soft band sources (at a median z — 1.1) and ~ 3.3 for the hard band sources (at a 
median z — 1), which translates into a host dark matter halo mass of ~ 10 13 h~ [ M Q and ~ 10 137 hr [ M e for the soft and hard bands, 
respectively. 

Key words. X-rays: galaxies - galaxies: active - surveys 

1. Introduction AGN can be dete cted over a wide range of re dshifts in contrast to 

normal galaxies dHartwick & Schad"a] T989). and therefore these 
The study of the large-scale structure for the universe and of objects ^ excellent tracers of the cosmic web and a conve . 

structure formation processes makes it necessary to carry out nient tool for studying evolutionary phenomena in the Universe, 
b , wide-field surveys of extragalactic objects. These surveys are It ig known that the optical and x . ray classification f type 2 
. * , performed in almost all accessible wavelength bands. X-ray (obscured) AGN agree quite we ll, see for example Garcet et al. 
surveys constitute an important part of these surveys because (2007) and re ferences therein. X-ray selected AGN also provide 
of the weak absorption at such high energies. The most re- a re i ative i y unbiased census of the AGN phenomenon because 
cent and prominent observational X-ray results have been ob- obscured AGN, which are largely missed in optical surveys, are 
tained with the XMM-Newton and Chandra space observatories included in X-rav survevs 
dBrandt & HasingeA I2005I) . More than 95% of all detected ob- 
jects in X-ray surveys away from the galactic plane are point- The clustering pattern of the AGN population can provide 
like and predominantly active galactic nuclei (AGN), the rest are important information regarding the cosmography of matter den- 
mostly extended sources (groups and clusters of galaxies and rel- sity fluctu ations at different scales and the cosmological param- 
atively nearby galaxies). Owing to their high X-ray luminosity, eters (e.g., Hickox et alll2007l:lEngels et all 1999 : Plionis et al.L 



l2010t lEbrero et all l2009t iBasilakos & Phonisl.l2009l.l201 Oh. the 

Send offprint requests to: A. Elyiv evolu t ion of the AGN phenomenon (e.g.. IComastri & Brusa, 

* This paper is dedicated to the memory of Olivier Garcet who has 120081: iKoulotlridis et all l2009l lAllevatO et all 1201 ll) . the rela- 

initiated the present work just before his sudden death. tion between AGN activity and their dark matter halo hosts, su- 
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permassive black hole formation (e.g., Mandelba um et all [2009: 
IMivaii et ail 120111: lAllevato et all 120111) . and so on. The most 
common approach to quantify AGN clustering, without red- 
shift information is to m easure the AGN two-point angular cor- 
relation function (ACF; IXkvlas et all l2000t lYang et all [2003; 
Manners et"alT, | 2003[lBasilakos et al l 120051: IGandhi et aU p006: 
Puccetti et all booa IMivaii et all 12007b fcarrera et all l2007t 
Garcet et all 120071 : lEbrero et all 120091) . which provides an es- 
timate of how significant the excess of AGN pairs is, within 
some projected angular separation over that of a random dis- 
tribution. Once the angular correlation function is measured, it 
is possible to reconstruct the spatial clustering, under s ome spe- 
cific assumptions, u sing the Limber integral equation (iLimberl 
1 19531: [PeeblesLlT980b . However, spectroscopic follow-up as well 
as multi wavelength photometric observations in a number of dif- 
ferent bands allow us to measure or estimate redshifts for a large 
number o f AGN and to apply the direct spatial correlation anal- 
ysis (e.g. iGiili et all 120051 lYang et all 12005 iGilli et all [2 009: 
ICoil et all 120091: ICappelluti et all 120101: IMivaii et aUl201 lh . 

Clustering analyses of the various surveys of X-ray selected 
AGN in the soft and hard bands have provided a wide range of 
angular and spatial clustering lengths. Strong indications for a 
flux-limit clustering dep endence appear to reconcile most of the 
diverse results, howe ver dPlionis et al ., 2008c lEbrero et afl l2009t 
LKrumpe et afl 1201 Oh . 

Another important question is wh ether the c l usterin g of X- 
ray selected AGN evolves with time. IGilli et al] d2009t) did not 
find any significant difference between the X-ray AGN clus- 
tering below and above z = 1. Even so, th e X-ray AGN bias 
factor should evolve with time, and indeed lYang et ail {2006) 
found a rapid increase of the bias factor with redshift with 
b(z = 0.4 5) = 0.95 + . 15 an d b(z = 2.07) = 3.03 + 0.83. 
Similarly, lAllevato et all d201 lh estimated the average bias in 
the COSMOS AGN survey and found a redshift evolution of the 
bias factor with b(z = 0.92) = 2.30 + 0.11 and b(z = 1.94) = 
4.37 + 0.27. 

The AGN clustering pattern can also be used for the verifi- 
cation of the unification model, because both obscured and un- 
obscured AGN should have identical correlation function, if the 
orientation of the torus is the only d etermining factor of the AGN 
phenomenology. IGilli et all d2009h used the 2 sq. deg. XMM- 
COSMOS field and did not find any significant difference in the 
spatial distribution of the broad and narrow line AGN. Similarly, 
Ebre ro et all (T2009), studying 1063 XMM-Newton observations, 
found consistent correlation properties for sources with high and 
low hardness ratios, which mostly correspond to obscured and 
unobscured AGN, respectively. These results postulate that ob- 
scured and unobscured objects populate similar environments, 
which agrees with the unified model of AGN. 

However, the analysis of the 9 sq. deg. Bootes multiwave- 
length survey showed slightly diffe rent cl u stering properties 
for the two t ypes of AGN dHickox et all 1201 lb . Similarly, 
iPuccetti et alJ d2006l) investigated the central 0.6 sq. deg. region 
of the ELAIS-S 1 field and found that the correlation amplitude in 
the hard band (9o = 12.8" ± 7.8") is 2.5 times higher than that in 
the soft band ( 5.2" + 3.8"), but with a weak significance (~ lcr). 
IGandhi et all ((2.006) used the hardness ratio (HR) and divided 
the point-like sources in mainly obscured (HR > -0.2) and un- 
obscured [HR < -0.2) subsamples, finding a positive clustering 
signal only for the obscured sources in the hard band. 

In this work we will revisit these questions by present- 
ing the final results of the point-like source distribution of the 
XMM-New t on La rge Scale Structur e (XM M-LSS) survey of 
iPierre et al.l d2004 . In IGandhi et all (l2006h we presented the 



AGN clustering results based on the previous release of 4.2 sq. 
deg. of this survey. A weak positive correlation signal was found 
in the soft band (angular scale 6q = 6.3" + 3" with a slope 
7 = 2.2 + 0.2). At present the full XMM-LSS field is one of 
the widest (~11 sq. deg) medium-deep surveys. It is part of an 
even larger project, t he XXL, observatio ns of which are currently 
being implemented (iPierre et al 

In the following sections we present the description of the 
XMM-LSS survey (Section 2). Results of the numerical simu- 
lations of the X-ray point-like sources are presented in Section 
3. Basic properties of the XMM-LSS field, like its source dis- 
tribution on the sky and the point-source log N - log S relation, 
are shown in Section 4. Section 5 includes the method used to 
produce the random catalogs and the ACF analysis for the dif- 
ferent samples. Inverting from angular to spatial clustering and 
the derived bias of AGN are presented in Sections 6 and 7, re- 
spectively, while the main conclusions are listed in Section 8. 

2. The sample of X-ray point-like sources 

In the present correlation function analysis, we have used point- 
like X-ray sources from the XMM-LSS field, which consists 
of 87 pointings with maximum available exposures from 10 
to 28 ks. Also we used 7 pointings of the i ndependent deeper 
Subaru/XMM-Newton Deep Survey (SXDS) dUeda et all [2008) 
whose data we reanalyzed with our pipeline because it is fully 
enclosed in the XMM-LSS area, although with a different spac- 
ing pattern. For S01 pointing of SXDS we kept only 40 ks chunk 
to prevent possible source confusion. 

Altogether, the XMM-LSS field is contiguous and contains 
~ 5700 sources in the soft (0.5 - 2 keV) band and ~ 2500 in 
the hard (2-10 keV) band, out of which ~ 180 are extended 
(mainly galaxy clusters). Although all extended sources were re- 
moved from our analysis, it is interesting to note that they were 
categorized according to their extension likelihoocQ (EXTlike) 
and core radius (EXT) into two classes: "CI" which are the true 
extended sources with EXTlike > 33 and EXT > 5", con- 
taining 54 objects, and "C2" which is a class with almost 50 per 
cent contamination at 15 < EXTlike < 33 and EXT > 5", 
containing 129 objects (see Pacaud et al. 2006 for details). 

Separately, we have made use of a more uniform XMM-LSS 
survey for our analysis which consists of 10 ks chunks. This cat- 
alog will be published soon (Chiappetti et al., in prep.) 

We considered all point-like sources as AGN, although w e 
do expect a ~ 3% stellar contamination (Salva toet all 12009). 
More details about the source classification will be given in 
Melnyk et al. in prep. The sensitivity limits of the joined sample 
are near 10~ 15 and 3 x 10~ 15 erg s _1 cirT 2 for the soft and hard 
bands, respectively. 

The average distances between the centers of adjacent point- 
ings were substantially shorter than the FoV diameter of the 
EPIC cameras to gain a more homogeneous coverage. This 
caused overlaps between adjacent pointings. Because each 
pointing was processed individually, the final merged catalog 
was produced a posteriori (Pierre et al. 2007; Chiappetti et al., in 
prep). As a first possibility, we only considered sources with an 
off-axis distance < 10'. In this way, we did not have to consider 
boundary sources that are often detected with large errors. The 
total effective area of the fields was 8.3 sq. deg. The distribu- 
tion of the corresponding 4066 X-ray sources located within the 
borders is shown in Fig.Q] As an alternative merging possibility 



1 By the term "likelihood" as used hereinafter, we mean formally the 
log-likelihood. 
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Fig. 1. Distribution of the X-ray point-like sources observed in 
the soft band within the whole XMM-LSS field with an off-axis 
distance less than 10'. The red circles represent the borders be- 
tween the different pointings. Note that even when using the 10' 
limitation, we may have some overlapping regions. We discarded 
these minor overlaps using the Voronoi tessellation method. 



we ap plied a Voronoi boundary delimitation (Matsuda & Shima, 
11984 without imposing an off-axis cutoff and using sources 
across the full exposure XMM-LSS field. In other words, in any 
overlap region among two pointings we only kept those detec- 
tions in our final catalog that had the shortest off-axis distance. 
In this way, we were able to use the widest possible area 10.9 sq. 
deg. of the investigated field. The distribution of the 5093 point- 
like X-ray sources is shown in Fig. [2] The basic ACF results were 
checked considering both approaches and it was found that apart 
from larger uncertainties in the case of the < 10' delimitation 
method, the results were statistically identical. 



3. Simulations of XMM-LSS AGN 

For a proper correlation function analysis we need to know the 
values of the detection probability for each registered source. 
Therefore we performed extensive simulations of each individ- 
ual XMM pointing. We briefly describe our procedure and the 
set of simulations that were used for the representation of the 
XMM-LSS field. 



3.1. Description of the XMM-Newton point-source simulation 

T he principle of the s imul ations is similar to th e one presented 
in Pacaud et al.l d2006l) and lGandhi et alj d2006l) . The main steps 
of the procedure consist in i) generating an input source list 
drawn from a fiducial flux distribution that is randomly dis- 
tributed across the pointing field of view; ii) simulating images 
of the field as it would be seen by XMM-Newton by reproducing 
the main instrumental effects (vignetting, PSF distortion, detec- 
tor masks, background and Poi sson noise); ii i ) dete cting sources 
with the XMM-LSS pipeline dPacaud et all I2006I) and obtain- 



Fig. 2. Distribution of the X-ray point-like sources observed in 
the soft band within the whole XMM-LSS field with the Voronoi 
tessellation method. Note that using the Voronoi tessellation we 
did not exceed the 13' off-axis distance for any pointing. 

ing their likelihoods and measured count-rate^ iv) correlating 
the detected source list with the input catalog using a 6" radius 
and deriving the rates of true and false detections as well as the 
detection probabilities. 

The simulations were performed in the soft and the 
hard bands. The or iginal source distribution was taken from 
Mor etti et alj (2003) using either their soft or hard band fitting 
formulae, down to a flux which approximately corresponds to 2 
photons on-axis (i.e. below XMM-Newton detection limit). This 
value depends on the exposure time chosen for each particular 
simulatio n. Non-resolved AGN p hoton background was added 
following iRead & Ponmanl (|2003 ). then we subtracted the con- 
tribution of the AGN resolved by our detection algorithm. The 
constant conversion factor cf between the total count-rates and 
the physical fluxes S was calcu lated on the basis o f the MOS and 
PN camera factors provided by [Pierre et al. (2007|). 

Partic le background was also added according to values 
quoted in Read & Ponman (2003) and was subsequently mod- 
ified by multiplying these values by an arbitrary factor between 
0.1 and 8 to allow for pointing-to-pointing background varia- 
tions. In any case, this component was not vignetted. We sum- 
marize our typical background values in TableQ] The PSF model 
was taken from the XMM-Newton medium model calibration 
files. The vignetting was modeled through its off-axis variation 
onto each detector. 

The detection algorithm provides for each source an estimate 
of its count-rate on each detector as well as the local background 
value at the source position. A key parameter is the source de- 
tection likelihood. Following iPacaud et al.l (120061) . this quantity 
was computed using the C-statistic. Its value is the difference be- 
tween the likelihood of the best-fitting point-source model and 
the likelihood of a pure background fluctuation. As such, the 



2 Throughout this paper, count-rates are expressed in terms of total 
MOS1+MOS2+PN count-rates, corrected for vignetting. That is why 
two sources with the same count-rate but different off-axis positions 
will have different probabilities of detection. 
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MOS1 
MOS2 
PN 



Photon background Particle background 
soft band hard band soft band hard band 



1.21 
1.32 
2.49 



1.77 
1.88 
3.55 



0.764 
0.730 
2.80 



1.16 
1.09 
6.03 



Table 1. Typical background values for the pointing simulations. 
We allowed the particle background to vary from one pointing to 
the other through a multiplicative factor chosen among 0. 1, 0.25, 
0.5, l,2,4and8. Units for each XMM-Newton detector are 10~ 6 
cts s _1 pixel -1 . 



T 

1 exp 


Limiting flux 


Particle background 


Number 


(ks) 


(10~' s erg s" 1 cm- 2 ) 


factors 


of fields 


Soft band 0.5 - 2 keV 


7 


1.43 


0.1 0.25 0.5 1 248 


7 x 540 


10 


1 


0.1 0.25 0.5 1 248 


7 x 540 


20 


0.5 


0.1 0.25 0.5 1 24 8 


7 x 540 


40 


0.25 


0.1 0.25 0.5 1 24 8 


7 x 540 


80 


0.125 


0.1 0.25 0.5 1 248 


7 x 540 


Hard band 2 - 10 keV 


7 


14.3 


0.1 1 3 


3x540 


10 


10 


0.1 1 3 


3x540 


20 


5 


0.1 1 3 


3 x 540 


40 


2.5 


0.1 1 3 


3 x 540 



Table 2. Summary of the soft and hard band simulation sets. The 
second column refers to the lowest flux of the input simulated 
sources in the band of interest. Different background levels are 
accounted for by applying a multiplicative factor to the values 
from Table [TJ This set of simulation encompasses most of the 
XMM-LSS pointing characteristics. 



source likelihood LH represents the significance of the detection. 
A value of 15 provides a good balance between contamination 
and completeness (see Pacaud et al. 2006 and paragraph ^. 2l for 
a discussion of the stability of this criterion). 

3.2. Set of simulations 

To fully account for the variations of the detection efficiency 
across the XMM-LSS fields, we simulated 18900 and 6480 
pointings in the soft and the hard bands, respectively. Table [2] 
details the simulation set. Fig. [3] illustrates the influence of ex- 
posure time and background ratio value for three pointings from 
our simulation set at 10 and 40 ks and for background ratios 1 
and 4. 

Thanks to the high number of simulated fields, we were able 
to bin our results by source off-axis angle. We have chosen six 
annuli of equal area to obtain approximately the same level of 
significance in each bin. Values defining the bin bounds are 0, 
5.3,7.5,9.2, 10.6, 11.9 and 13.0 arcmin. 




Exp = 7ks 
Exp = lOks 
Exp = 20ks 
Exp = 40ks 
Exp = 80ks 



0.0010 0.0100 
nput MOS1 +MOS2 + PN cot 



0.1000 



Fig. 5. Efficiency of our source detection algorithm in the inner- 
most annulus (0-5 arcmin) of the simulated soft band XMM ob- 
servations, as a function of the input source count-rate (or equiv- 
alently, flux for a typical AGN spectrum and a galactic hydrogen 
column density fixed to 2.6 x 10 20 cm -2 ). The exposure time 
differs from one curve to the other, but not the background rate. 



balance between completeness of the sample and contamination 
by spurious sources. 

We numerically computed the rate of false detections as a 
function of the off-axis angle, background level and exposure 
time and in any configuration. The average rate of spurious de- 
tections is between 2 and 5 per pointing (up to 13 arcmin off-axis 
angle). A typical pointing (T exp = 10 ks, background ratio = 1) 
gives from 40 to 50 detections in the [0 - 10] arcmin off-axis, 
which leads to a contamination rate of about 5%. 



3.2.2. Detection efficiency as a function of pointing 
characteristics 

We show in Figs. [5]|6]and[2] the probability curves derived from 
our simulations. These curves were computed by dividing the 
number of detected (ML> 15) sources by the number of input 
sources in a given input count-rate bin and for a given expo- 
sure time, background ratio and off-axis bin. The detection ef- 
ficiency is close to the flux-limited efficiency, whose limit de- 
pends on the local pointing characteristics. A strong dependence 
on the off-axis position is noticeable in Fig. [7] because the ef- 
fect of vignetting and PSF distortions are growing with off-axis 
distance. The exposure time dependence (Fig. |5j is compatible 
with a oc -y/r exp improvement factor over the signal-to-noise ra- 
tio, while the background level has a milder influence on the 
detection efficiency. In a typical pointing (r exp = 10 ks, back- 
ground ratio = 1) the flux limit is 2.5 x 10~ 15 (4 x 10~ 15 ) erg &' 1 
ctrT 2 at 50% (90%) completeness. 



3.2.1. Completeness/contamination balance 

The source selection was based on the detection likelihood value 
(ML); all detected sources with ML> 15 were included in the 
final sample of point-like sources. A fraction of these sources 
comes from false detections. Fig.|4]shows the distribution of the 
recovered sources for three configurations in the innermost off- 
axis bin (0-5.3 arcmin). We see little dependence of the contam- 
ination rate on the background level and exposure time, and the 
ML= 15 threshold appears as the best choice for homogeneous 



3.2.3. Relating real data to simulations 

Sensitivity maps across the entire XMM-LSS field can be de- 
rived through interpolation between simulated pointings. The 
exposure time of a given pointing is a straightforward quantity, 
as is the off-axis angle at the position of a source. To relate the 
background ratio quantity to real data, we used estimates of the 
local background fitted by our detection algorithm at each de- 
tected source position (see Pacaud et al. 2006 for a description of 
the fitting procedure). Estimated numbers of background counts 
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Fig. 3. Examples of simulated XMM-Newton pointings in the soft band. The large blue circle indicates the region in which the 
source detection is performed (13 arcmin maximal off-axis angle). Green boxes indicate the positions of sources detected with a 



likelihood ML > 15. Left: 
background ratio = 1 . 



exp 



10 ks, background ratio = 1; Middle: T exp = 10 ks, background ratio = 4; Right. 



T exp = 40 ks, 





0-0010 
isured MCS1 +MOS2 



0.0100 0-1000 
ount-rate (cts/s) 



0.0010 
isured MCS1 +MOS2+PN 



0.0100 
ount-rate (c 




nt-rate (cts/s) 



Fig. 4. Examples of likelihood versus count-rate plots for three combinations of pointing exposures and background ratios from our 
soft band simulation set. Green symbols show detections with a real input counterpart while red points indicate spurious (false) 
detections. All sources within 5 arcmin from the pointing center are shown here. Left: 540 pointings with r exp = 10 ks, background 
ratio = 1; Middle: 540 pointings with r exp = 10 ks, background ratio = 4; Right: 540 pointings with T exp = 40 ks, background 
ratio = 1. The horizontal line corresponds to ML= 15, the threshold above which detected sources are included in the catalog. The 
separation between false and real detections is relatively independent of the pointing quality. 



per pixel are put out as two quantities PNT_BG_MAP_MOS 
and PNT_BG_MAP_PN. Fig. [8] shows the relationship between 
the input background ratio and these quantities as derived from 
simulations. As expected, local background estimates computed 
by the detection algorithm are well correlated with the back- 
ground ratio values introduced in the simulations. We fitted the 
local background values by PNT_BG.MAP_MOS(PN) using the 
least-squares method separately for each pointing. With this we 
determined the background ratio level B, which corresponds to 
the best parameter of the fitting. 

4. Sky coverages and log N-\og S distributions 

An important characteristic of an X-ray survey is the sky cov- 
erage or, in other words, the effective area curve. This indicates 
the maximum effective area over which we can detect sources 
brighter than some given flux limit. We have constructed the area 
curves using the numerically calculated probabilities p to detect 
sources with a certain flux S , an off-axis distance R in a pointing 
with some effective exposure T and particle background level B. 



The effective area A(S) is calculated while integrating over the 
whole field area Q: 



MS)- 



I 



p(S,R, T, B)dCl. 



(1) 



Fig. [9] shows the effective area curves for the investigated sam- 
ples in the soft and hard bands, with a minimum flux 10" 15 for 
the soft and 3 x 10~ 15 erg s -1 cm -2 for the hard bands. For the 
construction of the illustrated area curves we used the Voronoi 
tessellation delimitation method. 

Fig. [10] shows a comparison between the normalized effec- 
tive area curves of various recent X-ray surveys. The effective 
area curve as a function of flux depends mainly on the depth 
of the source detection (indicated by the signal-to-noise ratio or 
likelihood thresholds). It also depends on the distributions of the 
pointing exposures, particle background level and the procedure 
of handling the pointing overlaps. Evidently, the COSMOS field 
has the lowest flux limit and the steepest area curve among the 
considerable surveys with the likelihood limit for the source de- 
tection being equal to 6 (ICappelluti et all 120071) . Our full ex- 
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PNT_BG_MAP_MOS : T=10ks PNT_BG_MAP_PN : T=10ks 



1 0.000 E 



1.000 r 



0.100 r 



0.010 = 



0.001 




1 0.000 E 



.000 



0.100 



0.010 



0.001 



400 

Offaxis (arcsec) 




400 

Offaxis (arcsec 



Fig. 8. Pipeline-estimated background values on MOS (left) and PN (right) detectors, in the soft band, as seen in the simulations. 
Curves from bottom to top stand for background ratios B equal to 0.1, 0.25, 0.5, 1, 2, 4 and 8. This plot shows how the local back- 
ground estimate output of the detection algorithm can be related to the background ratio parameter introduced in the simulations. 
The error bars represent l-cr standard variation computed from the source sample. The vertical lines correspond to bounds of our 
equal-area off-axis bins. Only the results for 10 ks are displayed and similar relations are extracted for 7, 20, 40 and 80 ks pointings. 



Flux for spectral index T=1.7 (10 " era s 1 cm ! ) Flux for speotrol index T=1.7 (10 14 erg s 1 cm 2 ) 

0.1 1.0 10.0 100.0 0.1 1.0 10.0 100.0 




Fig. 6. Same as Fig. [5] for various background rates in the soft 
band (defined by a multiplicative factor times the values quoted 
in the 2 column of TableQ}. Exposure time is held at 10 ks in 
all cases. 



posure XMM-LSS survey, having a significant fraction of the 
contributing pointings with exposures between 10 and 15 ks and 
a source detection threshold of ML= 15, has the next lowest 
flux-limit after the COSMOS survey, and a quite steeply increas- 
ing area-curve. The corresponding 10 ks XMM-LSS field has its 
area curve shifted to the right and its flux limit increased by a 
factor of ~ 1.2. 

Using those area curves and the differential distributions of 
the sources as a function of their flux, we constructed the log 
iV-log S relation. Note that it is important to take into account 
the flux boosting. This phenomenon especially affects faint ob- 
jects with a low detection probability. Owing to Poisson noise, 
we may detect objects fainter than the flux limit in successful 
cases and sometimes not detect sources brighter than the flux 
limit in unsuccessful cases. This may cause the creation of an 
artificial bump in the log N-log S distribution. 



Fig. 7. Same as Fig. [5]fbr the six off-axis bins in the soft band 
(see text). Exposure time is held at 10 ks in all cases, and the 
particle background ratio is set to 1 . 



To take this effect into account, we used the numerically sim- 
ulated dependencies between the input CRi„ and the output CR 0U , 
count rates individually for each pointing (see Fig.QT]for exam- 
ple). Clearly, when we detect some flux CR 0UI , it corresponds to a 
real input CRi„ distributed over a wide range. For each CR OI „ bin 
we constructed the density probability distribution as a function 
of CRj„. Fig. [T2"lrepresents the normalized distributions of CR,,, 
for three detected CR ol „. At low flux, we may see an asymmet- 
ric shape in the distribution that is shifted toward smaller C7?,-„ 
because of an artificial flux boosting. Therefore, we randomly 
chose some CR/,, for each detected source with CR out accord- 
ing to the density probability function. In this way, we carried 
out Monte-Carlo simulations with the deconvolution of the out- 
put into the input rates and constructed log N-\og S curves for 
various considered samples (Figs.[T3l-[T4li. 

The currently estimated log ./V - log S are lower for both 
bands than those of the 2XMM dEbrero et all [2009) and 
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Flux, S erg s" 1 cm" 2 CR output, cts/s 



Fig. 9. Effective area curves for the whole XMM-LSS field in the 
soft (0.5-2 keV) and hard (2-10 keV) bands. 



Fig. 11. Example of the dependence between input and output 
count-rates for the hard band, T — 20 ks, b — 1 . The red line 
corresponds to CRi„-CR out . 




full XMM-LSS 
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Fig. 10. Comparison of the normalized effective area curves in 
the soft band for the full exposure XMM-LSS field, the 10 
ks version (see sub section 5.1), the XMM-LSS 4.2 sq. deg. 
dGandhi et all 120061). t he XMM Me dium Deep Survey (XMDS 
Chiappett i et al.L l2005l) t he 2XMM (lEbrero et al l 120091) and the 
COSMOS (Mivaii et all l2007t ICappelluti et al.Ll2007l) . 



COSMOS ( ICappelluti et all |2007|) surveys, with deviations not 
exceeding the 2 - 3cr Poisson lev el. However, t hey e xcel- 
lently agree wi t h thos e derived by [Gandhi et all (120061) and 
Chiappett i et al.l (120051) . based on previous releases of XMM- 

|c 



LSS fields. Moreover, the XMDS dChiappetti et all 12005) was 
based on a totally different pipeline used for extracting the X- 
ray point-like sources. This suggests that the observed deficiency 
could be an intrinsic characteristic of the XMM-LSS field. 



Fig. 12. Distributions of CRj„ for three detected CR out created on 
the basis of the simulated distribution in Fig. 11. 



5. The angular correlation function analysis 

To determine the ACF, we generated random catalogs in the 
following way. Firstly, we distributed the fiducial point-like 
sources with random coordinates over the whole investigated 
field. Secondly, we chose for each random source a flux accord- 
ing to the log N-\og S distribution and calculated the probability 
p of detecting the corresponding point-like source in the rele- 
vant pointing, taking into account the exposure time, the particle 
background level (B) of the pointing and the off-axis distance of 
the corresponding source. Then, we chose a random number p 
for each random point-like source that is uniformly distributed 
between and 1. If the p value was less than p, we kept the 
source, if it was higher, we discarded the source. If a random 
source was closer than 10 arcsec to another one, we removed 
it because the extension of the EPIC PSF (~ 6 minimum, on 
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and the Landy & Szalay estimator (lLandv & Szalavl[l993l) : 
DD(ff) - 2DR{0) + RR(6) 



l+w(9)=fa- 



RR(9) 



(3) 



where DD, RR and DR represent the numbers of data-data, 
random-random and data-random pairs with a separation 9, 
while fu and fa are the corresponding normalization factors 
of the two estimators. In general the two estimators provide con- 
sistent results but in any case we will present the results based 
on both estimators in the correlation function plots. 

To speed-up our calculations, we divided the random catalog 
of those samples with more than 2000 X-ray sources into a max- 
imum of 10 random subcatalogs, and we averaged w{9) for each 
9 bin over the whole random catalogs. Note that we verified by 
investigating one such sample that the above procedure provides 
stable correlation results. The w(9) uncertainty in each f?-bin is 
given by 



Fig. 13. Log N-\og S distributions in the soft band for the 
whole XMM-LSS sample and for the two different procedures 
of handling the pointing over laps. The results of t he XMM 
Medium Deep Surve y (XMDS) dChiappetti et al.ll2005l). 2XMM 
dEbrero et all 120091) 



XM M-LSS 4.2 sq. d e e. ([Ga ndhi et al 



cr w = (1 +w)/VDD . 



(4) 



2006) and COSMOS (Mi vaii etall l2007t ICappelluti et al 



2007) are shown for comparison. The vertical bars denote lcr 
uncertainties. 



1000 - 



Full XMM-LSS: 
Voronoi borders 
< 10' limitation 



----XMDS 
2XMM 

• XMM-LSS, 4.2 sq. deg 
» COSMOS 




10" 



Fig. 14. Log N-log S distributions in the hard band for the 
whole sample and for the two different procedures of handling 
the pointing overlaps. For comparison we present the logN - 
log S distributions for the same samples as in Fig. [131 



axis) prevents one from detecting such close pairs and blends 
them into a single source. We generated random catalogs in this 
way that contain 100 times the number of point-like sources in 
the real source catalog, that was used in the present analysis. 
The larger the point population of the random catalog, the more 
accurate the ACF measurement because it suppresses random 
fluctuations caused by small numbers. 

To cal culate the ACF, w e use d two estimators, the Hamilton 
estimator (iHamiltonl fl993l) . as in lGandhi etall (12006): 



1 + w(9) = f H 



DD(9)RR(9) 
DR 2 {9) 



(2) 



The ACF calculations were performed for angular scales in 
the range: 20" < 9 < max , where max = 12000". We used 20" 
as our lowest angular-separation limit because of the large size 
of the XMM-Newton PSF near the FoV borders. We verified that 
pairs, constituted by sources belonging to adjacent pointings, are 
real only for pair separations > 20". We then fitted the resulting 
ACF with the power-law in the angular range where it was pos- 
sible, i.e., using only the positive w values: 



w(9) = (Bo/0) 



r-i 



(5) 



We analyze here the ACF of the full exposure XMM-LSS 
field. However, because there are strong indications for a flux- 
limit dependence o f the correlation function amplitude (e.g., 
Plion is et all [2008), we also analyzed a homogeneous sample 
of an effective 10 ks exposure over the whole XMM-LSS re- 
gion. To this end we cut the event list of the pointings into 10 ks 
chunks and repeated the source detection procedure from the be- 
ginning. We also separately estimated the ACF of samples based 
on the hardness ratio (HR). 

5.1. The whole XMM-LSS field 

We first present in Figs. [15] and [16] the ACF results of the full 
exposure XMM-LSS region for both the soft and hard bands and 
for the Voronoi delimitation and off-axis angle < 10' overlap ap- 
proaches. In the inset panels we present the 1, 2 and 3<x contours 
of the fitted parameters in the (f?o, y) plane, while in Table [3] we 
present the corresponding best fit 9q and y parameters and then- 
standard deviation, as well as the value of f?o for a fixed slope 
y — 1.8 and the integral ACF signal within separations of 3.3 
arcmin, w(< 3.3 ). Evidently, that there are no significant dif- 
ferences between the results based on the correlation function 
estimators (as seen in Figs. IT31 - IT6l> or on the two delimitation 
methods. Therefore we used for the remaining study only the 
samples based on the space-filling Voronoi deli mitation method 
and th e Landy & Szalay ACF estimator (see also Kerscher et akl 
(2000) for a detailed comparison of different estimators). 

Furthermore, we find that the hard band correlation func- 
tion is slightly but clearly stronger than the corresponding 
soft band, as can be also verified by comparing the corre- 
sponding inset contou r plots, which agree with the results of 
iBasilakos et al.1 (120051). iPuccetti et all (2006) but disagree with 
those of lEbrero et al.l d2009l) . 
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Fig. 15. Soft band ACF for the whole sample: Voronoi delimi- 
tation (top panel); off-axis angle < 10 (lower panel). The filled 
(black) points correspond to the Hamilton estimator while the 
open (red) squares to the Landy & Szalay estimator. The error 
bars represent l<x standard deviation. The dashed line represents 
the best power-law fit, while the continuous line corresponds to 
the constant y — 1.8 fit. The inset plot presents the 1, 2 and 3<x 
contours in the fitted (0q, y) parameter space. 



Fig. 16. Hard band ACF for the whole sample: Voronoi delimi- 
tation (upper panel); off-axis angle < 10 (lower panel). 

Table 4. Correlation function for the 10 ks chunk samples. 

Band N gg y ^ y=18 w(< 3.3') 

Soft 4360 3.2 ±0.5 1.93 ±0.03 1.3±0.2 0.005±0.007 
Hard 1712 9.9 ±1.4 1.98 ±0.04 3.8±0.7 0.092±0.019 



Table 3. The soft and hard band correlation functions for the 
whole XMM-LSS field, as well as for the two possible overlap 
approaches. N indicates the number of X-ray sources in the cor- 
responding sample, while the last column shows the integrated 
ACF signal, and its uncertainty, within 20" < < 200". 



Band Overlap N flp y jjoygig w(< 3.3') 



Soft 


Vor. 


5093 


1.3 


±0.2 1.94 


±0.02 


1.3±0.2 0.006±0.007 




< 10' 


4066 


1.4 


±0.3 1.81 


±0.02 


1.3±0.2 0.009±0.003 


Hard 


Vor. 


2369 


7.5 


± 0.9 2.00 


±0.03 


2.5±0.4 0.075±0.013 




< 10' 


1988 


6.5 


±0.8 1.91 


±0.03 


3.7±0.5 0.080±0.014 



As discussed above, to provide a "clean" ACF, that is un- 
affected by the convolution of (a) the variable flux-limit in the 
different parts of the survey and (b) the flux-limit clustering de- 
pendence, we considered a sample with a homogeneous 10 ks 
exposure time across the whole XMM-LSS area. Table [4] and 
Fig. [T7] show the parameters of the ACFs for both bands. The 
main variation with respect to the previous analysis is that the 
ACF difference between the soft and hard bands is now even 
more prominent. 



We also investigated the flux-limit dependence of clustering 
with our homogeneous 10 ks sample. To this end we estimated 
the angular clustering length, So for various flux-limited sub- 
samples by keeping the slope of the ACF fixed to its nominal 
value of y — 1.8. Fig.[T8lshows the corresponding results for the 
soft and hard bands. Evidently the known dependence is clearly 
reproduced with our data, and it will be interesting to investi- 
gate whether this dependence is present in the spatial correlation 
length, via Limber's inversion (see further below). Another in- 
teresting result is that the amplitude of the hard band ACF is 
larger than that of the soft band only in the lowest flux-limits. At 
flux limits > 10" 14 erg s _1 ctrT 2 the trend is reversed and the soft 
band is stronger than the hard band clustering. 

How do our results compare with those of other XMM sur- 
veys ? With respec t to our previous rel ease of the 4.2 sq. deg. 
XMM-LSS survey (iGandhi et all 120061) . our new catalog intro- 
duces many improvements. Among them is the wider (by ~ 2.6 
times) sampled area, and the inclusion of a deeper SXDS field. 
Furthermore, we updated the point-like source detection proce- 
dure and introduced a novel definition of the selection function 
and random-catalog generation procedure. 
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Fig. 17. ACF for the 10 ks sample in the soft band (upper panel) 
and for the hard band (lower panel). 
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Fig. 18. Best-fit correlation length 6q for y — 1 .8 as a function of 
the flux limit of the homogeneous 10 ks sample in the soft (filled 
circles) and the hard bands (open circles). 



Our curr ent XMM-LSS area curve is substantially different 
from that of lGandhi etalJ (120061) (see Fig. [TOjl. To investigate 
the reasons of this difference in detail, we used the 44 pointings 
common to both studies to compare the corresponding point-like 
source catalogs in the soft band. The current XMM-LSS catalog 
conta ins 2106 ob j ects w ith off-axis distances less than 10' and 
the iGandhi et alj ((2006) catalogue contains 1093 such sources, 
while the common sources are 1048. Fig. [19] shows the depen- 
dence between flux and ML for both catal ogs. Obviously, the 
chosen ML limit of the IGandhi et alj 12006) catalog is substan- 
tially higher than the current limit of ML =15 and it is equal to 



o 10 -'' 




100 1000 10000 

Maximum Likelihood 



Fig. 19. Dependence between flux and ML in the soft band for 
the point like sources of 44 pointings from the present XMM- 
LSS survey (black open circles) and from Gan dhi et al.1 ([2006) 
(red filled circles). 



ML ~ 40. It is also evident, inspecting Fig. [19] that a value of 
ML ~ 40 is associated with a significantly higher flux-limit with 
respect to that of ML = 15, causing the observed difference of 
the corresponding area curves (see Fig.fTOl. 

The above mentioned changes and improvements, particu- 
larly the lower ML detection limit, have res ulted in a variation o f 
some of our results with respect to those of Gan dhi et al.1 (|2006). 
Specifically, we found a slightly different clustering signal in the 
so ft band; 6 Q _ 1 3" + 0.2" for y = 1.81 vs 6.3" + 3" for y = 2.2 
in IGandhi et al.l d2006l) . Ho wever, at the fixed ca nonical value of 
the exponent (y = 1.8), fhe lGandhi"eta l. (2006) soft band anal- 
ysis provides a clustering amplitude of 6q = 1.7" + 0.9" versus 
1 .2" ± 0.2" for the current XMM-LSS survey. The lower correla- 
tion signal of our current XMM-LSS survey should be attributed 
to the lower ML limit, which introduces a significantly higher 
fracti on of faint sources with respect to the higher ML limit of 
the IGandhi et alJ (120061) catalog. Also, we found a significant 
clustering signal i n the hard band, in c ontrast to the absence of 
any significance in IGandhi etalJ (120061) . 

With respect to the COSMOS dMivaii et all 120071) and 
2XMM (Ebr ero et all [2009) surveys, we find (at fixed canoni- 
cal y — 1.8) a lower soft band correlation function amplitude, 
6»o = 1.2" ± 0.2", compared to 1.9" ± 0.3" and to 7.7" + 0.1" 
for the COSMOS and the 2XMM surveys, respectively. Our hard 
band (2-10keV) XMM-LSS correlation amplitude of 3.6"±0.7" 
is also lower than the corresponding 2XMM value of 5.9"+0.3", 
while the COSMOS hard band correlation results are not very 
significant, probably because they are divided into two sub- 
bands (2-4.5 and 4.5-10 keV). 

Note, however, that the wide contiguous area of the XMM- 
LSS survey implies that we should have a better estimation of 
w{9) on large angular scales (ie., 1000"< 6< 10000"), while 
COSMOS and 2XMM are limited to ~ 6000" and ~ 1000", 
respectively. 

In Fig. [20] we compare the soft band w(6) of our XMM-LSS 
and the 2XMM surveys. The large 2XMM w{0) amplitude at 
small angular scales is evident, although at ~ 1000" the two cor- 
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Fig. 20. Soft band ACF comparison between our XMM-LSS sur- 
vey (filled points) and that of the 2XMM survey (crosses) of 
lEbreroetal.1 (120091) . 



relation functions appear to be consistent. The higher 2XMM 
correlation amplitude should be attributed to the considerably 
different mix of faint and bright sources in the two surveys, as 
shown by their respective area curves (see Fig. ITOt . The larger 
part of faint sources in the current XMM-LSS survey causes 
the lower amplitude of the source angular correlation function 
with respect to the 2XMM, as expected f rom the known de- 
pendence be t ween clustering and flux-limit (Plionis et al. [ l2008t 
Ebrer o et all |2009), a fact which has also been verified by our 
analysis (Fig.Qj). 

5.2. Subsamples of sources with soft and hard spectra 

An interesting question, that relates to the unification paradigm 
of AGN, is whether the clustering pattern, among others, of hard 
and soft-spectrum AGN is comparable. According to the unifi- 
cation paradigm, what determines the appearance of an AGN as 
obscured or unobscured (type II or I) is its orientation with re- 
spect to the observer's line-of-sight. Therefore, there should be 
no intrinsic difference in their clustering pattern. On this ques- 
tion there have been conflicting results in the literature and we 
here re-address this with our data. 

To this end we compared the correlation function of the hard 
and soft-spectrum sources by separating them, within each band, 
using the hardness ratio, HR, indicator defined as 



HR 



CR,, - CR S 
CR,, + CR. S ' 



(6) 



where CR, and CR/, represent the total count rates in the 
soft and the hard band, respectively. It is known that most of 
the sources with HR> -0.2 are likely to be obscured (hard- 
spectrum) AGN; conversely, the sourc es with HR< -0.2 are 
mostly (soft-spectrum) unobscured (see lGandhi et alJ (120041) for 
details). Using this criterion, we split the whole sample and de- 
rived the log A^-log S distributions for each of them in the soft 
and the hard bands (Fig. [2TT>. Tableland Figs. I22l-l23l show the 
parameters and ACFs for the obtained subsamples. 

The main result of this analysis is that there is a distinct clus- 
tering difference between the sources with hard and soft spectra 
in the soft band, with the former sources being significantly more 
clustered. In the hard band the corresponding comparison shows 
a much weaker difference, in the same direction, but not that 
significant. However, one also observes that the integrated Big- 
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Fig. 21. Log N-log S distributions in the soft and the hard bands 
for sources with different hardness ratios. 
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Fig. 22. ACF for the whole XMM-LSS sample in the soft band 
for sources with HR> -0.2 (filled circles, hard-spectrum AGN), 
and for sources with HR< -0.2 (open circles; soft-spectrum 
AGN). Note that for clarity reasons we do not plot the w{6) 
uncertainties of the later sources. The solid line represents the 
y — 1.8 fit to the HR> -0.2 w(ff), while the dashed line corre- 
sponds to the HR< -0.2 w(0) fit. 
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Fig. 23. As in Figure|22]but for the hard band. 
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Table 5. Correlation function for the subsamples characterized 
by their hardness ratio above and below -0.2. 



with 



Band HR N 



"0,y=1.8 



W(< 3.3 ) 



Soft >-0.2 674 10.3 ±3.3 1.93 ± 0.08 5.2+2.0 0.066+0.048 
<-0.2 4418 1.5 + 0.2 1.80 + 0.02 1.5+0.2 0.019+0.005 



Hard >-0.2 1170 
<-0.2 1198 



10.7: 

13.1 : 



1.7 1.94: 
2.4 2.04 : 



0.04 5.5+1.0 0.129+0.028 
0.06 4.4+1.1 0.056+0.026 



nal within separations $ 3 arcmin indicates that at least on these 
small scales the hard-spectrum sources show a stronger cluster- 
ing signal than the c orresponding soft-sp ectrum ones. Similar 
results were found in Gand hi et al.l d2006l) . Therefore, one may 
conclude that indeed there are indications for a different cluster- 
ing pattern between hard-spectrum and soft-spectrum sources, 
which cannot be attributed to their different flux-limits, since we 
verified that this result is valid for brighter flux-limits as well. We 
believe that this result suggests a possible environmental compo- 
nent in the determination of the different types of AGN, beyond 
their orientation with respect to the observer's line-of-sight. An 
environmental dependence of the A GN type has also been foun d 
in loc al optical AGN samples (e.g.. iKoulouridis et all 1200 6a b. 
120091 and references therein). 

6. Inverting from angular to spatial clustering 

We now derive the spatial correlation length that corresponds to 
the measured angu lar clustering. To this end we used the usual 
Limber inversion (lPeeblesi ri980). The main steps are sketched 
below. 

In a spatially flat universe, the ACF w(ff) can be obtained 
from the spatial one, £(r), by 



w{6) = 2 



[JT x 2 c/>(x)dx] 2 



(7) 



where the physical separation between any two sources that are 
separated by an angle 6 and considering the small angle approx- 
imation, is given by 



r — - — (u 2 + x 2 8 2 ) 
(1+z) 1 ' 



1/2 



(8) 



while <p(x) is the selection function (the probability that a source 
at a distance x is detected in the survey) given by 



<f>(x) = j 



<D(L x ,z)dL 



(9) 



Az) 



where <J)(L A ,z) is the redshift-dependent luminosity function of 
the X-ray selected AGN. A variety of X-ray source luminosity 
functions are available in the literature, and to investigate the un- 
certainty that their differences can introduce in the derived value 
of ro, we will present results for a number of <t>(L A , z). Although 
th e most recen t soft /hard band luminosity func tions are those 
of Ebr ero et a we will also use those of Hasinge r et alj 

(120051) for the soft band, while for the hard band we used those 
of lUeda et al] d2008l) and of lLa Franca et all d2005l) . In all cases 
we used of course the luminosity-dependent density evolution 
model of the luminosity function. 

The proper distance x(z) is related to the redshift through 



x(z) 



-~ P 

Ho Jo 



dy 
E(y) 



(10) 



£(z) = [Q m (l+z) J + fi A ] 1/2 Q m =l-0 A . (11) 

In thi s context, the s patial correlation function can be mod- 
eled as in lde Zotti etafl dl990h 

£(r )Z ) = (r/r Hx(l+z)- (3+e) , (12) 

where ro is the correlation length in three dimensions and e(= 
y — 3) parameterizes the type of clustering evolution. A value of 
e = -1.2 for y — 1.8, indicates a constant clustering in comov- 
ing coordinates, while e = -3 indicates a constant clustering in 
physical coordinates (e.g., de Zot ti et ailll990h . 

Combining the above system of equations, we obtained the 
following integral equation for w(ff) 



w(9) = 2 



where dN/dz denotes the number of objects in the given survey 
within a solid angle Q. s and in the shell (z, z + dz). It takes the 
following form: 



dN 
"dl 



Q. s x 2 ^(x) 



Ho 



(14) 



Using Eq.dTSt. Eq.© and Eq.dTSl. we find that the ampli- 
tude 6<o in two dimensions is related to the corr elation length ro 
in thr ee dimensions through the equation (see Basilakos et al., 

120051) : 



= H v r, 



•r'o ( 



J_cLV\ 2 
N~dzj 



m 

xy- l (z) 



(1 +zy 3 -" +y dz, (15) 



where7/ 7 = r(i)r(^)/r(|). 

Following the previous steps, we derived the spatial cluster- 
ing length scale for fixed y = 1.8 and for both values of clus- 
tering evolution parameter (e = -1.2 and -3). The results are 
presented in Table|6] Evidently that all three hard band luminos- 
ity functions provide the same ro value, while for e = - 1 . 2 there 
is a difference in the soft band with the lHasinger et all d2005h 
<D V (L), p roviding an r p value that is 16% higher than that pro- 
vided bv lEbrero et a l. (2009) <D V (L). As we will see, this differ- 
ence increases proportionally to the flux-limit of the subsample 
used. In Fig.|24]we present the inverted ro values as a function of 



Table 6. Spatial correlation length ro (in h~ l Mpc), provided by 
Limber's inversion of the ACF and using different AGN X-ray 
luminosity functions, for the homogeneous 10 ks sample and for 
the lowest flux-limit available. Note that the corresponding soft 
and hard band median redshifts are z ^ 1.1 and ^ 1 , respectively, 
while the peaks of the corresponding redshift distributions are at 
z — 1 and 0.7, respectively. 



Soft band 


Hard band 


6 


Ebrero Hasinger 


Ebrero 


La Franca 


Ueda 


-1.2 


6.2+0.7 7.2+0.8 


10.1+0.9 


9.8+0.9 


10.1+0.9 


-3 


3.2+0.4 3.3+0.4 


5.3+0.5 


5.2+0.5 


5.3+0.5 



the different flux limits, as they appear in Fig. [18] We see that for 
the soft band the two luminosity functions used in the inversion 
provide ro values that diverge with increasing flux-limit. 
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Fig. 24. Spatial correlation length ro for y = 1.8 and e - -1.2 
considering the homogeneous 10 ks based observations as a 
function of the flux limit of the sample in the soft band (filled and 
empty circles correspond to the Ebrero and Hasinger <E> X (L), re- 
spectively) and in the hard band (filled square, open squares and 
open hexagons correspond to the Ebrero, Ueda and La Franca 
<f> x (L), respectively). 



The dashed lines in Fig.l24"lcorrespond to fits of the data, us- 
ing for each band results based on all different luminosity func- 
tions, of the form: 



f. 



3 x 10- 15 



(16) 



with (A,/3) =s (6.5, 0.54) for the soft band and (A,0) * (9.4, 0.1) 
for the hard band. Evidently, the flux dependence of clustering, 
once one inverts from angular to 3D space, is preserved mostly in 
the soft band. In the hard band we see at most a weak dependence 
and only for fluxes ~ 2 x 10 -14 erg s" 1 cm 2 , while a constant 
hard band value of ro - 10 hr x Mpc, irrespective of the flux 
limit, appears also to be consiste nt with the data. These hard 
band results agree with those o f lEbrero et all d2009h . who found 
that the weak dependence of Qq on the flux-limit translates into 
a roughly constant ro as a function of flux-limit, or equivalently 
as a function of median redshift or median X-ray luminosity of 
the sample. However, a relatively strong dependen ce of the soft 
band r p with respect to the flux-limit disagr ee with lEbrero et alJ 
(2009), but agrees with lPlionis etail (120081) . 



7. Bias of the X-ray selected AGN 

The concept of biasing between different classes of extragalactic 
o bjects and the back ground matter distribution was introduced 
bv lKaiserl (11984 and Bardeen et al. ( 1986) to explain the higher 
amplitude of the two-point correlation function of clusters of 
galaxies with respect to that of galaxies themselv es. In our case 
and within the fram ework of linear biasing (cf. iKaisetl Il984t 
iBenson et al.L [2000), the evolution of the bias parameter is usu- 
ally defined as 



b\z) 



1 



6>m(8,z)' 



(17) 



where £agn(%,z) = (r (z)/8) r and £dm(8,z) are the spatial cor- 
relation functions of AGN and dark matter halos evaluated at 8 
h~ l Mpc, respectively. Notice that the correlation lengths in 3D 



are presented in Table|6] The correlation function of the DM ha- 
los is given bv lPeeblesfil 19801) 



&m(8,z) = 



h ' 



(18) 



where J 2 = 12/ [(3 - y)(4 - y)(6 - y)2 y ] and of (z) is the dark 
matter density variance in a sphere with a comoving radius of 8 
h~ l Mpc, which evolves as 



<rsQ0 = cr H D(z)/D(0) 



(19) 



Note that D(z) is the linear growth factor scaled to unity at the 
present time. For the concordan ce A cosmologjQ the growth fac- 
tor becomes (see Peebles! Tl993h 



D(z) 



5Q m £(z) 



f 



(1+30 
EHy) 



dy 



(20) 



Finally, inserting Eq.dT9b and Eq.dTSb into Eq.JTTb. we obtain the 
evolution of biasing with epoch as a function of the clustering 
properties 



Hz) = 



roiz) 



y/2 



1/2 



cr s D(z)/D(0) 



(21) 



For angular clustering we may identify the dominant redshift 
of the sample under study as that predicted by the luminosity 
function of the sources used and the flux limit of the sample, 
which predicts the redshift distribution of the sources. We can 
then obtain from the last equation an estimate of the bias of our 
X-ray sources (see Table[7]i. Of course one has to keep in mind 
that this is a quite crude estimate since we implicitly assume that 
all detected sources obey the same luminosity function, while in 
effect luminosity functions are derived from subsamples of all 
the detected X-ray sources for which optical counterparts are 
identified. 

Table 7. Linear bias factor for the lowest flux-limit results of the 
homogeneous 10 ks XMM-LSS data (and for the same X-ray 
luminosity functions as in Table 6). 



Soft band 


Hard band 


6 


Ebrero Hasinger 


Ebrero 


La Franca 


Ueda 


-1.2 


2.2±0.2 2.7±0.3 


3.3±0.3 


3.2±0.3 


3.3±0.3 


-3 


1.2+0.1 1.3±0.1 


1.9±0.2 


1.8+0.2 


1.9+0.2 



We see again that although our h ard band results roughly 
agree with those of lEbrero et al.l (2009), our soft band results are 
significantly different, because we found a significantly weaker 
clustering amplitude than the aforementioned authors. 

We can now use a bias evolution model ( e.g., Sheth et all 
1200 ll: iBasilakos. Plionis & R agone-Fi gueroal I2008L and refer- 
ences therein) to estimate the halo mass that corresponds to the 
above estimated bias factors (for e - —1.2), assuming that each 
halo hosts one AGN source. Using the latter model (see details in 
Papageorgiou, Plionis, Basilakos & Ragone-Figueroa in prep.), 
we obtain that for the soft band and the Ebrero et al. luminosity 
function the corresponding halo mass is M/ t io I2 - 9±0 - 3 h~ l M G , 
while using the Hasinger luminosity function the corresponding 
value is M h =s 10 13 2±0 3 h~ l M Q . For the hard band we find that 
M h =a 10 13 7±03 h~ l M . Note that using the Sheth et al. bias 
model, we find very similar M/, values (for example, for the hard 
band results we find M h 10 13 6 h' l M Q ). 



In this work we use Q. m = 1 - fl A = 0.3 and cr 8 = 0.80. 
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8. Main conclusions 

We have performed a two-point correlation function analysis of 
the XMM-LSS sample of point sources that contains in total 
94 XMM-Newton pointings (more than five thousand point-like 
sources). The observations were made near the celestial equator 
at high galactic latitudes over ~ 1 1 sq. deg. in the soft (0.5-2 
keV) and hard (2-10 keV) bands with effective exposures rang- 
ing from 8.1 to 47.3 ks. The minimum flux limits are almost 
10" 15 and 3 x 10" 15 erg s _I cm -2 for the soft and hard bands, 
respectively. For the definition of the detection probabilities for 
each source and for the proper generation of the mock catalogs 
we performed a series of numerical Monte-Carlo simulations of 
the XMM-Newton observations. The most important points and 
results of our work are listed below. 

To deal with the pointing overlap question, we considered 
two approaches: that of a 10' off-axis limitation, and the Voronoi 
delimitation. No major differences were observed in the derived 
point-source correlation function between these two approaches. 
We consequently followed the statistically richer Voronoi delim- 
itation approach, which produces a contiguous field. 

The log N - log S distributions for the soft and hard bands 
were found to agree well wit h the results f r om th e previously 
released XMM-LSS catalog dGandhi et all 120061) . Using the 
whole exposure XMM-LSS data, we extended the log N - log S 
to lower fluxes, ie., 10~ 15 and 3 x 10~ 15 erg s ctrT 2 for the soft 
and hard bands, respectively. 

The amplitude of the correlation function w(9) is signifi- 
cantly higher in the hard band than in the soft band at the lowest 
fluxes. When analyzing a homogeneous 10 ks extracted sam- 
ple from the full exposure data, this difference becomes more 
prominent. At higher fluxes (f x Z 10~ 14 erg s _1 crrT 2 ) the am- 
plitude of the correlation function becomes higher in the soft 
band. These results provide a bias factor at a median redshift 
z — 1.1 of ~ 2.5 for the soft band when inverted to 3D (and 
for e = —1.2), and at z — 1 of ~ 3.3 for the hard band sources. 
These bias values correspond to a mass of the halos hosting the 
AGN sources of M h ~ 10 13±0 - 3 h~ l M G for the soft band and 
M h ~ 10 13 J±0 - 3 h^Me for the hard band. 

The correlation at degree-scale (Z 3000") nicely extends that 
observed on an arcmin scale (100 - 1000"), a result which is ob- 
tained thanks to the wide contiguous area covered by the survey. 

The hard-spectrum sources show a stronger clustering than 
the soft-spectrum ones, especially in the soft band. This hints at 
an environmental dependence of the AGN type. 

The amplitude of the spatial correlation function grows with 
flux limit, but mostly in the soft band. In the hard band there is 
at most a weak dependence, with a constant value of ro - 10 h' 1 
Mpc, which is consistent with the data. 
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